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Abstract. Succinctness is a natural measure for comparing the strength of different log- 
ics. Intuitively, a logic Li is more succinct than another logic L2 if all properties that can 
be expressed in L2 can be expressed in Li by formulas of (approximately) the same size, 
but some properties can be expressed in Li by (significantly) smaller formulas. 

We study the succinctness of logics on linear orders. Our first theorem is concerned 
with the finite variable fragments of first-order logic. We prove that: 

(i) Up to a polynomial factor, the 2- and the 3-variable fragments of first-order logic on 
linear orders have the same succinctness. 

(ii) The 4- variable fragment is exponentially more succinct than the 3-variable fragment. 
Our second main result compares the succinctness of first-order logic on linear orders 

with that of monadic second-order logic. We prove that the fragment of monadic second- 
order logic that has the same expressiveness as first-order logic on linear orders is non- 
elementarily more succinct than first-order logic. 



It is one of the fundamental themes of logic in computer science to study and compare 
the strength of various logics. Maybe the most natural measure of strength is the expressive 
power of a logic. By now, researchers from finite model theory, but also from more applica- 
tion driven areas such as database theory and automated verification, have developed a rich 
toolkit that has led to a good understanding of the expressive power of the fundamental 
logics (e.g. iniElEl)- It should also be said that there are clear limits to the understanding 
of expressive power, which are often linked to open problems in complexity theory. 

In several interesting situations, however, one encounters different logics of the same 
expressive power. As an example, let us consider node selecting query languages for XML- 
documents. Here the natural deductive query language monadic datalog [7| and various 
automata based query "languages" |13[ I14| have the same expressive power as monadic 
second-order logic. XML-documents are usually modelled by labelled trees. Logics on trees 
and strings also play an important role in automated verification. Of the logics studied in 
the context of verification, the modal /z-calculus is another logic that has the same expressive 
power as monadic second-order logic on ranked trees and strings, and linear time temporal 
logic LTL has the same expressive power as first-order logic on strings jllj . 
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Succinctness is a natural measure for comparing the strength of logics that have the 
same expressive power. Intuitively, a logic Li is more succinct than another logic L2 if all 
properties that can be expressed in L2 can be expressed in Li by formulas of (approximately) 
the same size, but some properties can be expressed in Li by (significantly) smaller formulas. 

For both expressiveness and succinctness there is a trade-off between the strength of a 
logic and the complexity of evaluating formulas of the logic. The difference lies in the way 
the complexity is measured. Expressiveness is related to data complexity, which only takes 
into account the size of the structure in which the formula has to be evaluated, whereas 
succinctness is related to the combined complexity, which takes into account both the size 
of the formula and the structure ^7] . 

Succinctness has received surprisingly little attention so far. A few scattered results are 
jl8l [^[TllHll5j: for example, it is known that first-order logic on strings is non-elementarily 
more succinct than LTL |lllll5j . In [S], we started a more systematic investigation. Specifi- 
cally, we studied the succinctness of various logics on trees that all have the same expressive 
power as monadic second-order logic. While we were able to gain a reasonable picture of 
the succinctness of these logics, it also became clear that we are far from a thorough under- 
standing of succinctness. In particular, very few techniques for proving lower bounds are 
available. 

Most of the lower bound proofs use automata theoretic arguments, often combined 
with a clever encoding of large natural numbers that goes back to Stockmeyer ^^l- In |Hji 
these techniques were also combined with complexity theoretic reductions to prove lower 
bounds on succinctness under certain complexity theoretic assumptions. Wilke used 
refined automata theoretic arguments to prove that CTL"'' is exponentially more succinct 
than CTL. Adler and Immerman |^ were able to improve Wilke's lower bound slightly, 
but what is more important is that they introduced games for establishing lower bounds on 
succinctness. These games vaguely resemble Ehrenfeucht-Frai'sse games, which are probably 
the most important tools for establishing inexpressibility results. 

In this paper, we study the succinctness of logics on linear orders (without any additional 
structure). In particular, we consider finite variable fragments of first-order logic. It is 
known and easy to see that even the 2-variable fragment has the same expressive power as 
full first-order logic on linear orders (with respect to Boolean and unary queries). We prove 
the following theorem: 

Theorem 1.1. (i) Up to a polynomial factor, the 2 and the 3- variable fragments of 
first-order logic on linear orders have the same succinctness, 
(ii) The 4-variable fragment of first-order logic on linear orders is exponentially more 
succinct than the 3- variable fragment. □ 

For the sake of completeness, let us also mention that full first-order logic is at most expo- 
nentially more succinct than the 3- variable fragment. It remains an open problem if there 
is also an exponential gap in succinctness between full first-order logic and the 4-variable 
fragment. 

Of course the main result here is the exponential gap in Theorem 11.11 (ii) , but it should 
be noted that (i) is also by no means obvious. The theorem may seem very technical and 
not very impressive at first sight, but we believe that to gain a deeper understanding of the 
issue of succinctness it is of fundamental importance to master basic problems such as those 
we consider here first (similar, maybe, to basic inexpressibility results such as the inability 
of first-order logic to express that a linear order has even length). The main technical 
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result behind both parts of the theorem is that a 3-variable first-order formula stating that 
a linear order has length m must have size at least ^\/Tn. Our technique for proving this 
result originated in the Adler-Immerman games, even though later it turned out that the 
proofs are clearer if the reference to the game is dropped. 

There is another reason the gap in succinctness between the 3- and 4- variable fragments 
is interesting: It is a long standing open problem in finite model theory if, for k > 3, the 
k variable fragment of first-order logic is strictly less expressive than the (A:+l)-variable 
fragment on the class of all ordered finite structures. This question is still open for all 
A; > 3. Our result (ii) at least shows that there are properties that require exponentially 
larger 3-variable than 4- variable formulas. 

Succinctness as a measure for comparing the strength of logics is not restricted to logics 
of the same expressive power. Even if a logic Li is more expressive than a logic L2, it is 
interesting to know whether those properties that can be expressed in both Li and L2 
can be expressed more succinctly in one of the logics. Sometimes, this may even be more 
important than the fact that some esoteric property is expressible in Li, but not L2. We 
compare first-order logic with the more expressive monadic second-order logic and prove: 

Theorem 1.2. The fragment of monadic second-order logic that has the same expressive- 
ness as first-order logic on linear orders is non-elementarily more succinct than first-order 
logic. □ 

The paper is organised as follows: After the Preliminaries, in Section |31 we prove the 
main technical result behind Theorem ll.il In Sections 0] and El we formally state and prove 
the two parts of the theorem. Finally, Section El is devoted to Theorem 11.21 

The present paper is the full version of the conference contribution 

2. Preliminaries 

We write N for the set of non-negative integers. 
We assume that the reader is familiar with first-order logic FO (cf., e.g., the textbooks 
Ell)- ^ natural number k we write FO'^ to denote the k-variable fragment of 

FO. The three variables available in FO^ will always be denoted x, y, and z. We write 
F0^(<, succ, min, max) (resp., F0^(<)) to denote the class of all FO^-formulas of signature 
{<, succ, min, max} (resp., of signature {<}), with binary relation symbols < and succ and 
constant symbols min and max. In the present paper, such formulas will always be inter- 
preted in finite structures where < is a linear ordering, succ the successor relation associated 
with <, and min and max the minimum and maximum elements w.r.t. <. 

For every G N let An be the {<,succ, min, max}-structure with universe {0, . . , N}, < 
the natural linear ordering, min"^^ = 0, maxr^^ = N, and succ the relation with (a, b) G succ 
iff a+1 = b. We identify the class of linear orders with the set {An ■ A G N}. 

For a structure A we write U'^ to denote ^'s universe. When considering FO^, an 
interpretation is a tuple {A, a), where A is one of the structures An (for some A G N) 
and a : {x, y, z} lA-^ is a variable assignment in A. To simplify notation, we will extend 
every assignment a to a mapping a : {x, y, z, min, max} U'^, letting a{min) = min-^ and 
a(max) = max^. For a variable v G {x,y,2;} and an element a G ti-^ we write a[-] to 
denote the assignment that maps f to o and that coincides with a on all other variables. If 
j4 is a set of interpretations and is an F0^(<, succ, min, max)-formula, we write A\= ip 
to indicate that 99 is satisfied by every interpretation in A. 
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In a natural way, we view formulas as finite trees (precisely, as their syntax trees), where 
leaves correspond to the atoms of the formulas, and inner vertices correspond to Boolean 
connectives or quantifiers. We define the size of ip to be the number of vertices of ip's 
syntax tree. 

Definition 2.1. [Succinctness] 

Let Li and L2 be logics, let F be a class of functions from N to N, and let C be a class 
of structures. We say that Li is F -succinct in L2 on C iff there is a function f £ F such 
that for every Li-sentence ipi there is an L2-sentence (p2 of size ||<^2|| ^ /(ll'/'ill) which is 
equivalent to (pi on all structures in C. □ 

Intuitively, a logic Li being F-succinct in a logic L2 means that F gives an upper bound 
on the size of L2-formulas needed to express all of Li. This definition may seem slightly 
at odds with the common use of the term "succinctness" in statements such as "Li is 
exponentially more succinct than L2" meaning that there is some Li-formula that is not 
equivalent to any L2-formula of sub-exponential size. In our terminology we would rephrase 
this last statement as "Li is not 2°("*) -succinct in L2" (here we interpret sub-exponential as 
2''(™) ^ but of course this is not the issue) . The reason for defining F-succinctness the way 
we did is that it makes the formal statements of our results much more convenient. We will 
continue to use statements such as "Li is exponentially more succinct than L2" in informal 
discussions. 

Example 2.2. F0^(<, succ, min, max) is 0(m)-succinct in F0^(<) on the class of linear 
orders, because succ(a;,y) (respectively, x=min, respectively, x=max) can be expressed by 
the formula 

{x < y) A -i3z[{x<z) A {z < y)) 
(respectively, -<3y{y < x), respectively, -i^y {x <y))- □ 

3. Lower bound for FO^ 

3.1. Lower Bound Theorem. Before stating our main lower bound theorem, we need 
some more notation. 

If 5 is a set we write 7^2 (-5) for the set of all 2-element subsets of S. For a finite 
subset S of N we write MAX S (respectively, MIN S) to denote the maximum (respectively, 
minimum) element in S. For integers m,n we define 

dijf{m,n) := m — n 

to be the difference between m and n. We define <-type{m, n) G {<, =, >} as follows: 

if m < n then <-type{m,n) := "<" , 

if m = n then <-type{m,n) := "=" , 

if m > n then <-type{m,,n) := ">". 
We next fix the notion of a separator. Basically, if A and B are sets of interpretations and 
5 is a separator for {A,B), then 6 contains information that allows to distinguish every 
interpretation I E A from every interpretation E B. 

Definition 3.1. [separator] 

Let A and B be sets of interpretations. 

A potential separator is a mapping 

6 : V2 {{min, max, x, y, z}) — > N . 
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6 is called separator for {A,B), if the following is satisfied: For every Z := (A, a) £ A and 
J' := G B there are u,u' G {min,max,x,y,z} with u / u' , such that n'}) ^ 1 

and 

1. <-t?/pe(a(ii), a(u')) 7^ <-type[P{u), P{u')) or 

2. (5({n,n'}) ^ MJiV{|diif(a(u),a(u'))l , M^if(/3(^^), /?(^^')) I } and 

Note that 5 is a separator for {A,B) if, and only if, (5 is a separator for {{I}, {J'}), for all 
I £ A and J £ B. For simplicity, we will often write ^^J) instead of ({X}, 

Let us now state an easy lemma on the existence of separators. 

Lemma 3.2. If A and B are sets of interpretations for which there exists an FO^(<)-formula 
such that A\= "(l) and B \=^'^^ then there exists a separator 6 for {A,B). □ 

Proof: We need the following notation: For a number d G N and an interpretation {A, a) 
choose 

Ord{A,a) : {0, 1, 2, 3, 4} ^ {min, x, y, z, max} 
such that a(^Ord{A, a){i)) ^ a(Or(i(^, a)(i + 1)), for all ^ i < 4. Furthermore, choose 

Distd{A,a) : + : 0^i<4}^{0,..,2'^+^} 

such that the following is true for all ^ -i < 4: 

Distd{A,a){i,i + l) = diff(^a{Ord{A,a){i+l)),a{Ord{A,a){i))y or 

DistdiA,a){i,i + l) = 2"^+^ ^ diff(^a{Ord{A,a){i+l)) ,a{Ord{A,a)ii))y 

Finally, we define the d-type of {A^ a) as 

Typed{A,a) := [Ord{A,a), Distd{A,a)). 
Using an Ehrenfeucht-Fraisse game, it is an easy exercise to show the following (cf., e.g., 
I3J: 

Lemma 3.3. Let d G N and let {A, a) and (13,(3) be interpretations. If Typed{A,a) = 
Type^iB, (3), then {A, a) and {B, j3) cannot be distinguished by FO(<)-formulas of quantifier 
depth f^d. □ 

Let d be the quantifier depth of the formula ip. We define 5 to be the potential separator 
with 5{p) := 2'^"'"^, for all p G V2{{mm, max, x, y, z}). 

To show that 6 is, in fact, a separator for {A,B), let [A, a) G A and {B,(3) G B. Since 
{A,a) \= ■0 and {B,f3) ^ xp, we obtain from Lemma 1531 that Typed{iA,a)) / Type^dB, (3)), 
i.e., 

1. Or<^,a) / Ord{B,(3), or 

2. L'istd(^,a) / DistdiB,(3). 

Therefore, there exist u,u' G {niin,max,x,y,z} with n 7^ u', such that 

1. <-type[a{u),a{u')) 7^ <-type[P{u), (3{u')) , or 

2. di#(a(n),a(n')) / difj{l3iu), (3{u')) and 

<5({n,n'}) =2^+1 ^MJJV{Mi#(a(n),a(n'))|, Mi#(/3(n), /3(n')) |}. 
Consequently, is a separator for {A,B), and the proof of Lemma I.S. 21 is complete. ■ 
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Definition 3.4. [weight of S] 

Let 5 be a potential separator. We define 

(a) the border- distance 

b{6) := MAX { 6{{min, max}), 6{{min,u}) + 6{{u' , max}) : u,u' £{x,y,z} } 

(b) the centre- distance 

c{5) := MAX {6{p)+5{q) : p,q€V2{{x,y,z}), p^q} 

(c) the weight 



w{6) := v/c(5)2 + 6(5). □ 

There is not much intuition we can give for this particular choice of weight function, except 
for the fact that it seems to be exactly what is needed for the proof of our main lower 
bound theorem (Theorem l3.6|) . At least it will later, in Remark l3.131 become clear why the 
^-function is used for defining the weight function. 

Definition 3.5. [minimal separator] 

6 is called a minimal separator for {A, B) if (5 is a separator for (A, B) and 

w{5) = MIJV{ : is a separator for }. □ 

Now we are ready to formally state our main lower bound theorem on the size of F0=^(<)- 
formulas: 

Theorem 3.6. [main lower bound theorem] 

If tp is an F0'^(<, succ, mill, max)-formula, A and B are sets of interpretations such that 
A\= il) and B \= and (5 is a minimal separator for {A, B), then 

IIVII ^ □ 

Before giving details on the proof of Theorem 13.61 let us first point out its following easy 
consequence: 

Corollary 3.7. 

Let n > m ^ 0. The two linear orders Am and An (with universe {0, . . , m} and {0, ■ ■ ,n}, 
respectively) cannot be distinguished by an F0'^(<, succ, min, max)-sentence of size < ^\/m. 

□ 

Proof: Let tj^ be an F0^(<, succ, min, max)-sentence with Am |= and An |= Let a 
be the assignment that maps each of the variables x, y, and z to the value 0. Consider the 
mapping 

Sm '■ V2{{^in, max, x,y, z}) ^ N defined via 
m , if p = {min, max} 
, otherwise. 

It is straightforward to check that w{5m) = \/rn and that 6m is a minimal separator for 

( {Am, a) , (An, a) ) . 
From Theorem 13 . 61 we therefore obtain that 

Ml ^ i-u;(5„) - 1 
This completes the proof of Corollarv 13.71 



2 - vm. 



THE SUCCINCTNESS OF FIRST-ORDER LOGIC ON LINEAR ORDERS 



7 



To prove Theorem 13.61 we need a series of intermediate results, as well as the notion of 
an extended syntax tree of a formula, which is a syntax tree where each node carries an 
additional label containing information about sets of interpretations satisfying, respectively, 
not satisfying, the associated subformula. More precisely, every node v of the extended 
syntax tree carries an interpretation label il{v) which consists of a pair {A, B) of sets of 
interpretations such that every interpretation in A, but no interpretation in satisfies 
the subformula represented by the subtree rooted at node v. Basically, such an extended 
syntax tree corresponds to a game tree that is constructed by the two players of the Adler- 
Immerman game (cf., V). 

For proving Theorem 13.61 we consider an extended syntax tree T of the given formula 
i/j. We define a weight function on the nodes of T by defining the weight w{v) of each node 
V oiT to be the weight of a minimal separator for il(v). Afterwards — and this is the main 
technical difficulty — we show that the weight of each node v is bounded (from above) by 
the weights of v^s children. This, in turn, enables us to prove a lower bound on the number 
of nodes in T which depends on the weight of the root node. 

3.2. Proof of Theorem 13. 6L We start with the formal definition of extended syntax trees. 
Definition 3.8. [extended syntax tree] 

Let be an FO'^ (<, succ, min, max)-formula, let A and B be sets of interpretations such 
that A \= ip and B |= ^ip. By induction on the construction of ip we define an extended 

syntax tree Xj,^'^'^ as follows: 

• li ip is an atomic formula, then T^"^'^^ consists of a single node v that has a syntax label 
sl{v) := ijj and an interpretation label il{v) := {A,B). 

• Ifip is of the form -'ipi, then T^^'^'^ has a root node v with sl{v) := and il{v) := {A, B). 
The unique child of v is the root of T^^'^\ Note that B |= tpi and A |= -'ipi. 

• li ip is of the form ipi V ip2, then T^^'^"^ has a root node v with sl{v) := V and il{v) : = 
(AB). 

The first child of v is the root of xj^^'^K The second child of v is the root of 
where, for i G {1,2}, Ai = {{A,a) G A : {A,a) \= Tpi}. 
Note that A = AiU A2, Ai |= Vi, and B |= ^ipi. 

• li tp is of the form ipi A 1^2, then T^^'^"^ has a root node v with sl{v) := A and il{v) : = 
{A,B). 

The first child of v is the root of 7^^'^^^. The second child of v is the root of 7^^'^^\ 
where, for i G {1, 2}, Bi = {{B, (3) e B : {B, (3) ^ VJ- 
Note that B = Bi U B2, A \= ipi, and Bi |= ^^i. 

• If ^/^ is of the form 3mpi, for a variable u G {x,y,z}, then T^^'^'^ has a root node v 
with sl{v) := 3u and il{v) := {A, B). The unique child of v is the root of 7^^^'^^\ where 
Bi := {{B, I3[^]) : {B, p) e B, be U^}, and Ai is chosen as follows: For every {A, a) e A 
fix an element a£U-^ such that {A, a[f ]) |= ipi, and let Ai := {{A, a[f ]) : {A, a) G A}. 
Note that Ai |= ipi and Bi |= ^ipi. 

• If ^/^ is of the form \/uipi, for a variable u G {x,y,z}, then T^^'^'^ has a root node v 
with s/(t>) := Vn and il{v) := {A, B). The unique child of v is the root of T^^^''^^\ where 
Ai := {{A, a[f ]) : {A, a) £ A, a e U-^}, and Bi is chosen as fohows: For every {B, 13) & B 
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fix an element b £ such that {B,P[^]) |= ^Vi, and let Bi := {{B,a[^]) : {B,l3) £ B}. 
Note that Ai |= ipi and Bi |= -■V'l- D 

The following is the main technical result necessary for our proof of Theorem 13.61 

Lemma 3.9. Let ip be an FO^ (<, succ, miii, max)-formula, let A and B be sets of interpre- 
tations such that A\= il) and B \= -iip, and let T be an extended syntax tree T_^^'^\ 
For every node w of T the following is true, where 5 is a minimal separator for il(v): 

(a) If iJ is a leaf, then w^S) ^ 1. 

(b) If V has 2 children vi and V2, and 6i is a minimal separator for il{vi), for i G {1, 2}, then 
w{6) ^ w{6i) + w{62). 

(c) If V has exactly one child vi, and 6i is a minimal separator for il{vi), then tt;(5) ^ 
w{6i) + 2. □ 

The proof of Lemma 13.91 is given in Section [3.31 below. 

For a binary tree T we write ||T|| to denote the number of nodes of T. For the proof of 
Theorem 13.51 we also need the following easy observation. 

Lemma 3.10. Let T be a finite binary tree where each node v is equipped with a weight 
w{v) ^0 such that the following is true: 

(a) If f is a leaf, then w^v) ^ 1. 

(b) If V has 2 children vi and V2, then w{v) ^ w{vi) + w{v2)- 

(c) If V has exactly one child vi, then w{v) ^ w{vi) + 2. 

Then, ||T|| ^ ^ • w{r) , where r is the root of T. □ 
Proof: By induction on the size of T. 

If T consists of a single node v, then ||T|| = 1^^-1; and 1 ^ 'tt'(^'), since u is a leaf. 
If T consists of a root node v whose first child vi is the root of a tree Ti and whose second 
child V2 is the root of a tree T2, then ||T|| = 1 + UTiH + ||T2||. By induction we know for 
i G {1,2} that ||7i|| ^ ^w{vi). From the assumption we have that w{v) ^ w{vi) +w{v2)- 
Therefore, 

||r|| ^ 1 + + iu;(v2) > ^w{v). 

If T consists of a root node f whose unique child vi is the root of a tree 7i, then ||T|| = 
1 + \ \Ti\\. By induction we know that \\Ti\\ ^ From the assumption we have that 

w{v) ^ w{vi) + 2, i.e., ^w{v) ^ + 1. Therefore, ||T|| ^ 1 + ^w{vi) ^ 

This completes the proof of Lemma 13.101 ■ 

Using Lemma 13.91 and I3.1U1 we are ready for the 

Proof of Theorem IHT^ 

We are given an FO^ (<, succ, niin, max)-formula i/j and sets A and B of interpretations such 
that A\= ip and B |= -itp. Let T be an extended syntax tree T^'^\ 

We equip each node u of T with a weight w{y) := where 5^ is a minimal separator 

for il{v). Prom Lemma 13.91 we obtain that the preconditions of Lemma l3.1(JI are satisfied. 
Therefore, ||T|| ^ ^ • w{r), where r is the root of T, i.e., w{r) = w{6), for a minimal 
separator (5 for il{r) = {A,B). 

From Definition 13.81 it should be obvious that \\ip\\ = \\T\\. Therefore, the proof of Theo- 
rem EISl is complete. ■ 
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3.3. Proof of Lemma 13. 9L We partition the proof of Lemma 13.91 into proofs for the parts 
1^, (0^, and where part ^ turns out to be the most elaborate. 

According to the assumptions of Lemma 13.91 we are given an F0'^{<, succ, min, max)- 
formula ip and sets A and B of interpretations such that A\= ip and B |= ^ip. Furthermore, 
we are given an extended syntax tree T = 7^"^'^^ . Throughout the remainder of this sec- 
tion, T will always denote this particular syntax tree. 

Proof of part of Lemma 13. 9t 

Let t> be a leaf of T and let 5 be a minimal separator for {A^,B^) := il{v). Our aim is to 
show that w{6) ^ 1. 

By Definition 13.81 we know that sl{v) is an atomic formula of the form R(u, u') for 
R G {<,=, succ} and u,u' G {min, max, x , y , z} . Furthermore, Ay |= R{u,u') and B^ \= 
^R{u,u'). I.e., for all (A, a) G A^ and iB,f3) £ B^, 

<-type{a{u),a{u')) / <-type[(3{u), [5{u')) 
or 

\diff{a{u),a{u'))\ = 1 / \diff{fj{u),(3{u'))\. 
In case that u ^ u' we can define a separator 5 for {Ay, By) via 

1 , if p = {u, n'} 
, otherwise. 



hp) 



Since (5 is a minimal separator, we obtain that w{5) ^ w{5) = 1. 

It remains to consider the case where u = u' . Here, Ay |= R{u,u) and By |= -^R[u,u). 
Since R G {<, =, succ} this implies that Ay = % ov By = 0. Therefore, according to Defini- 
tion 13.11 the mapping 5 with 5{p) = 0, for all p € V2{{min,inax,x,y, z}), is a separator for 
{Ay, By). Hence, w{6) ^ w{5) = 0. 

This completes the proof of part ^ of Lemma 13.91 ■ 

The essential step in the proof of part of Lemma 13.91 is the following easy lemma. 

Lemma 3.11. Let v he a, node of T that has two children vi and V2. Let 5i and 82 be 
separators for il{vi) and il{v2), respectively. Let 5 be the potential separator defined on 
every p G V2{{min, max, x, y, z}) via 

5{p) := 5i{p) + 52{p). 
Then, 5 is a separator for il{v). □ 

Proof: Let {A,B) := il{v). We need to show that 5 is a separator for 1(1, J), for all Z £ A 
and J £ B. Let therefore Z := {A, a) £ A and J := {B, P) G B he fixed for the remainder 
of this proof. 

Since v has 2 children, we know from Definition 13.81 that sl{v) = V or sl{v) = A. Let 
us first consider the case where sl{v) = V. 

From Definition 13.81 we know that, for i £ {1,2}, il{vi) = {Ai,B), where AiL) A2 = A. 
Therefore, there is an i G {1,2} such that Z £ Ai. From the assumption we know that 5i 
is a separator for {Ai,B). Therefore, there are u,u' G {mm,max,x,y, z} with u ^ u' , such 
that 5i{{u,u'}) ^ 1 and 

1. <-type{a{u),a{u')) / <-type[P{u), /3{u')) or 
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2. 6{{u,u'}) ^ MJiV{|di#(a(u),a(u'))| , \d^{(i(u) , fi(v!))\} and 
diS{a{u),a{v!)) / diS{fi{u),fi(y!)). 

Since (5({u, li'}) = ^idu, u'}) + (52({u, u'}), we know that (5({n, n'}) ^ u'}). Therefore, 

(5 is a separator for {Z^J). This completes the proof of Lemma for the case that 
sl{v) = V. 

The case sl{v) = A follows by symmetry. ■ 
Using Lemma llH.llL the proof of part (Ef) of Lemma ll-{.9l is straightforward: 
Proof of part ([bp of Lemma 13. 9t 

Let V be a node of T that has two children vi and V2- Let 5, 5i, and respectively, 
be minimal separators for il{v)^ iK^i), and il{v2), respectively. Our aim is to show that 
w{6) ^ w{6i) + 10(62). 

Let 5 be the separator for il{v) obtained from Lemma 13.111 Since (5 is a minimal 
separator for il{v), it suffices to show that wi^S) ^ w{6i) + w{62)- 

Using Definition 13.41 it is straightforward to check that b{6) ^ b{6i) + b{62) and c{6) ^ 
c{6i ) + c{52 ) • From this we obtain that 

w{5)^ = c{5)^ + b{6) 

^ {c{5i) + ci62)f + b{6i) + b{S2) 

= c{6if + + 462)^ + b{d2) + 2461)462) 

^ w{6i)'^ + wi62)^ + 2wi6i)w{62) 

= {w{6i) + w{62))\ 

I.e., we have shown that w{6) ^ w{6i) + 10(62)- 

This completes the proof of part ^ of Lemma 13.91 ■ 

An essential step in the proof of part (fijj of Lemma 13.91 is the following lemma. 

Lemma 3.12. Let v he a node of T that has syntax-label sl(v) = Qu, for Q G {3,V} and 
u £ {x, y, z}. Let 61 be a separator for il(vi), where vi is the unique child of v in T. Let 6 
be the potential separator defined via 

• 6({u,u'}) := 0, for all u' & {min, max, x , y , z} \ {u} , 

• 6({min,max}) := MAX 6i({min, max}) , 6i({min,u}) + 6i({u, max}) + I ^ , 
and for all u' , u" such that {x, y, z} = {u, u', u"} and all m G {min, max}, 

• 6({u',u"}) := MAX{6i({u',u"}), 6i({u' ,u}) + 6i({u,u"}) + I } , 

• 6({m,u'}) := MAX{6i({m,u'}), 6i({m,u}) + 64{u,u'}) + I } . 

Then, 5 is a separator for il(v). □ 

Proof: We only consider the case where Qu = 3z. All other cases Q G {3,V} and u G 
{x,y,z} follow by symmetry. 

Let {A, B) := il(v). We need to show that 5 is a separator for (X, J), for all X G A and 
J G B. Let therefore X := (A,a) G A and J := (13,(3) £ B he fixed for the remainder of 
this proof. The aim is to show that J is a separator for (X, J') . 

Since sl(v) = 3z, we know from Definition 13. 81 that il(vi) = {Ai, Bi), where Bi contains 
the interpretations (B,j3[^]), for all b G U'^ , and Ai contains an interpretation a[|]), for 



THE SUCCINCTNESS OF FIRST-ORDER LOGIC ON LINEAR ORDERS 11 



£l^{{min,z}) ^ 
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B in Case 2: 



Figure 1: Situation in Claim 3. 

a particular a G U'^. We define Oa := a[f]> '■= {A,aa), and for every b G U^, Pb ■= P[^] 
and Jb := (i3,/3fe). 

From the fact that is a separator for 5/(^1), we in particular know, for every b G U^, 
that 5i is a separator for {Za,Jb)- I-e-, we know the following: 

For every b G there are Ub,u'^^ G {min, max, x, y, z} with Ub 7^ u^, such that {Ta,Jb) is 
separated by 6i{{ub,u'jj}), i.e., 6i{{ub,u'^}) ^ 1 and 

(1) ^: <-type{aa{ub),aaiu[)) / <-type{(3b{ub), (3biu[)) , or 

(2) ^: ^ MIJV{|rfi#(a,M,a,K))|, /J^K)) | } and 

diff{aa{ub),Oia{u[)) / diff{Pb{ub),f3b{Ub))- 

In what follows we will prove a series of claims which ensure that 5 is a separator for {I, J') . 
We start with 

Claim 1. If there is a 6 G U'^ such that {Ta,Jb) is separated by (5i({nt,n^) with z ^ 
{ub,u'^}, then 5 is a separator for {I, J). □ 

Proof: As z we have, by definition of 5, that 5({ub,ti^}) ^ (5i({w6, n'^}). There- 

fore, {l)b and (2)f, imply that 5 is a separator for {Za.,Jb) as well as for l^,J). 
This completes the proof of Claim ^ ■ 

Due to Claim^it henceforth suffices to assume that for no 6 G , i^a,Jb) is separated by 
5\{{ub-,u'^}) with z {ub,u'fj}. I.e., we assume that for every b G , {Za,Jb) is separated 
by 5i{{mm,z}), 5i{{z,max}), 6i{{x,z}), or 5i({y,z}). 

Claim 2. If a = a{u) for some u G {min, max, x, y}, then 5 is a separator for (Z^J). □ 

Proof: Choose b := j3{u). Therefore, aa{z) = a = a{u) = aa{u) and I3b{z) = b = j3{u) = 
Pbiu). 

We know that {Za,Jb) is separated by 5i{{z,u'}), for some u' G {miu, max, x, y}. Fur- 
thermore, since aa{z) = aa{u) and (3b{z) = we have u' 7^ n. 

By definition of 5 we know that n'}) ^ 5i({z, ^i'})- Therefore, 5 is a separator for 
{Za.,Jb) as well as for {Z,J). 

This completes the proof of Claim [^1 H 
Due to Claim|21it henceforth suffices to assume that, a 7^ for all u G {miii, max, x, y}. 

Claim 3. If 5i{{inin, z}) ^ difj{a, min'^), then ^ is a separator for {Z,J'). □ 
Proof: We distinguish between two cases. An illustration is given in Figure ^ 
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SI: I \ \ 1 

a(x) a 

B in Case 1: \ \ \ 1 

P(,t) b 

(B in Case 2: I \ 1 i 

Figure 2: Situation in Claim 5 (for the special case that a{x) ^ a). 

Case 1: diff{ma^ , miiP) ^ diff{a,miir^). 
In this case we can choose h G lA^ with 

diff{b, min'^) = diff{a, min'^) 

(simply via b := a). Obviously, {Ia,J^b) is not separated by 6i{{mm, z}). However, we know 

that {Ia,<Jb) is separated by 5i{{z,u'}), for some u' G {x,y,inax}. 

Since 

dijf{a,min-^) +6i{{z,u'}) ^ 5i{{mm, z}) + 6i{{z,u'}) ^ 5({min,n'}), 

it is straightforward to see that {ZajJ'b), and also {I, J'), is separated by 6{{min,u'}). I.e., 
(5 is a separator for {ZjJ'). 

Case 2: diff{max^ , min^) < diff{a,min'^). 
Since 

^({min, max}) ^ 5i{{min, z}) ^ diff{a, min'^) > diff{max^ , min'^), 

we know that {I, J') is separated by 5{{min, max}). I.e., 5 is a separator for {ZjJ'). 

This completes the proof of Claim |31 H 

By symmetry we also obtain the following 

Claim 4. If 6i{{z, max}) ^ diff^max^ , a) , then 5 is a separator for {I, J'). □ 
In a similar way, we can also show the following 

Claim 5. If 5i{{x,z}) ^ \diff{a{x),a)\, then (5 is a separator for (T, J"). □ 
Proof: We distinguish between three cases. An illustration is given in Figure |21 
Case 1: There is ab G lA^ such that diff{(3{x),b) = dijj{a{x) , a) . 

Obviously, {laiJb) is not separated by 5i({rc,z}). However, we know that {laiJb) is sepa- 
rated by (5i({z, u'}), for some u' G {miii,y, max}. 
Since 

\diff{a{x),a)\+5i{{z,u'}) ^ 5i{{x, z}) + 5i{{z,u'}) ^ ^({x,n'}), 

it is straightforward to see that {laiJb)-, and also (^^J)^ is separated by (5({x,u'}). I.e., b 
is a separator for (X, J") . 
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a in Case 1 
\ 



a in Case 3 

\ 



a in Case 2 

\ 



a{x) 



P(x) 



a(y) 



Figure 3: Situation at the beginning of Claim 7. 

Case 2: a{x) ^ a and diff{ma^ ^ I3{x)) < dijf{a,a{x)). 
Since 

6{{x,max}) ^ 5i{{x,z}) ^ diff{a,a{x)) > dijf{max^ , f3{x)), 

we know that {1,J') is separated by 5{{x, max}). I.e., (5 is a separator for {Z,J'). 

Case 3: a < a{x) and diff{P{x), min^) < diff{a{x),a). 
This case is analogous to Case 2. 

Now the proof of Claim |S1 is complete, because one of the three cases above must apply. ■ 
By symmetry we also obtain the following 

Claim 6. If 5i{{y,z}) ^ \diff{a{y),a)\, then 5 is a separator for (^^J). □ 
Finally, we show the following 

Claim 7. If none of the assumptions of the Claims is satisfied, then 5 is a separator 
for(J,:7). □ 

Proof: We assume w.l.o.g. that min"^ ^ a{x) ^ ^ max^. 

Since Claim ^ does not apply, we know that also min^ ^ (i{x) ^ ^ max^. 

Since Claims ISHHl do not apply, we furthermore know that 

1. \dijf{a{u'),a)\ > 5i{{u',z}), for all G {min, max, x, y}, and 

2. min'^ < a < a{x) or a{x) < a < a{y) or ct{y) < a < max^. 

We distinguish between different cases, depending on the particular interval that a belongs 
to. An illustration is given in Figure |31 

Case 1: miir^ < a < a{x). 

Case 1.1: diff{P{x), min^) ^ 6i{{min, z}) . 

By definition of ^ we have diff{(3{x) , min^) ^ 5{{min,x}). Since 

dijj{a{x) , min"^) > dijj{a,iniir^) > 6i{{min, z}) ^ diff{(3{x), min^), 
we therefore know that (X, J') is separated by 5{{min, z}). I.e., 5 is a separator for (X, J'). 
Case 1.2: diff{f3{x),miiP) > 5i{{mm, z]) . 

In this case we can choose b ^ I3{x) such that diff{b, min^) = 5i{{min, z})+l. An illustration 
is given in Figure 0] Then, {la^Jh) is not separated by 5i({min, z}). However, we know 
that (Xa, Jb) is separated by 5i{{z, u'}), for some u' G {x, y, max}. From the assumptions of 
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I « a(x) 

®; I H \ 

: b 13(x) 

I ^ 

8({inin,z}) 



Figure 4: Situation in Case 1.2 of Claim 7. 



a{x) a a(y) 

I i 

M{x,z}) 

Figure 5: Situation in Case 3.2 of Claim 7. 

Claim[7|we know that diJJ{a{u'),a) > 6i{{z,u'}). Hence we must have that dijf{f3{u'),b) ^ 
6i{{z,u'}). Therefore, 

diffif3iu'), min^) = diff{f3{u'), b) + diff{h, miiP) 

^ 5i{{z,u'])+5i{{mm,z]) + l 

^ (5({min, n'}). 

Since 

diff{a{u), min'^) = diff(a{u),a) + diff{a, min'^) > dijj{f3(u), min^), 

we hence obtain that 5 is a separator for (T, J) . 

Case 2: a{y) < a < max^. 
This case is analogous to Case 1. 

Case 3: a{x) < a < a{y). 

Case 3.1: diff(0(y), 0(x)) ^ d^(ix,z}). 

By definition of 5 we have diff{P{y), (3{x)) ^ 5{{x,y}). Since 

dijf{a{y),a{x)) > diff{a,a{x)) > 6i{{x,z}) ^ diff{P{y), l3{x)), 
we therefore know that {I, J) is separated by 6{{x,y}). I.e., 5 is a separator for {I, J'). 
Case 3.2: diff{(){y) , p{x)) > 5i{{x , z]) . 

In this case we can choose h with f3{x) < b ^ f3{y) such that dijj{b, (3{x)) = 6i{{x,z}) + 1. 
An illustration is given in FigureEl Then, {Ia,Jb) is not separated by 6i{{x,z}). However, 
we know that {Za,Jb) is separated by 5i({z,u'}), for some u' G {mm, y, max}. From the 
assumptions of Claim [3 we know that \diff{a.,a{u'))\ > 6i{{z,u'}). Hence we must have 
that \dijj{b, (3{u'))\ ^ 5i{{z,u'}). We now distinguish between the cases where u' can be 
chosen from {y,max}, on the one hand, and where u' must be chosen as min, on the other 
hand. 
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a{x) a a(y) 



b bj f.iy) 



i §,toz}) 6,({Z,3'}) 

^^({min,z}} 

Figure 6: Situation in Case 3.2.2 of Claim 7. 

Case 3.2.1: u' G {y, max}. 
In this case, 

dz#(/3K),/3(x)) = dtMl3{u),b) + dtMb,(3{x)) 
^ 5ii{z,u'}) + 6ii{x,z}) + l 
< ~6i{x,u'}). 

Since 

diff{a{u'),a{x)) = diff{a{u),a) + diff{a,a{x)) > dijJ{f3{u),P{x)), 
we hence obtain that (5 is a separator for {I, J') . 
Case 3.2.2: u' ^ {y,max}. 

In this case, {Ia,J^b) is separated by 5i{{min, z}), and we may assume that it is neither 
separated by 5i{{z,y}) nor by 5i{{z, max}) nor by 5i{{x,z}). In particular, we must have 
that 

diffiPiy),b) ^ 6ii{z,y}) + l. 

Therefore, for every b' with 

b ^ b' < I3iy)-5i{{z,y}), 

the following is true: {Xa,Jb>) is neither separated by nor by but, 

consequently, by 5i{{inin,z}) or by 5i{{z , max}) . Let bi be the largest such b' for which 
{Ta,Jb') is separated by 6i{{mm,z}). In particular, 

diff{bi,mivP) ^ 5i{{mm,z}). 

An illustration is given in Figure IHl 

Case 3.2.2.1: diffiPiy),bi+l) ^ 5i{{z,y}). 
In this case we know that 

diff{P{y),mm^) ^ 5i{{z,y}) + I + 5i{{mm, z}) ^ ~5{{mm,y}). 

Furthermore, 

diff{a{y) , min-^) = diff{a{y) , a) + diff{a, min-^) 

^ 5i{{z,y}) + l + 5i{{min,z}) + l. 

Therefore, 

diff{a{y),mm^) / dijf{l3{y), min^), 
and {I, J') is separated by 6{{mm,y}). I.e., 5 is a separator for {I, J'). 
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Case 3.2.2.2: diff{Piy),bi+l) > 6i{{z,y}). 

In this case we know (by the maximal choice of bi) that {la, J'bi+i) must be separated by 
6i{{z , max}) . In particular, diff{max^,bi+l) ^ 5i{z, max). Therefore, 

diff{max^ , min'^) ^ 5i{{z, max}) + 1 + 6i{{min, z}) ^ 6{{min, max}). 

Furthermore, 

diff{max^ , min"^) ^ diff{max^,a) + diff{a,min'^) 

^ 6i{{z,max}) + l + 6i{{min,z}) + 1. 

Therefore, 

diff{max:^ , miir^) ^ dijj{max^ , min^), 
and {I, J') is separated by 5{{mm, max}). I.e., (5 is a separator for {Z,J'). 

We now have shown that (5 is a separator for (T, J), if Case 3 applies. 

Together with the Cases 1 and 2 we therefore obtain that the proof of Claim [3 is complete. 

■ 

Since at least one of the Claims 00 must apply, the proof of Lemma 13.121 finally is com- 
plete. ■ 

Proof of part of Lemma 13. 9t 

Let be a node of T that has exactly one child vi. Let 5 be a minimal separator for 
il{v), and let 5i be a minimal separator for {Ai,Bi) := il{vi). Our aim is to show that 
w{5) ^ w{5i) + 2. 

From Definition 13.81 we know that either sl{v) = or sl{v) = Qn, for some Q G {3,V} 
and u £ {x, y, z}. 

Case 1: sl{v) = 

In this case we know from Definition 13.81 that il[v) = {Bi,Ai). Therefore, 5i also is a 
(minimal) separator for il{v). In particular, w{5) = w{5i) ^ ^^(^i) + 2. 

Case 2: sl{v) = Qu 

In this case let 6 be the separator for il{v) defined in Lemma 13.121 Since (5 is a minimal 
separator for il{v), it suffices to show that w^S) ^ w{6i) + 2. 

Let u',u" be chosen such that {x,y,z} = {u,u',u"}. Using Definition 13.41 and the 
particular choice of 5, it is straightforward to see that 

c{S) = ~5{{u',u"}) ^ c(5i) + l (3.1) 

and that 

5{{min,max}) ^ b{di) + 1 . (3.2) 
Furthermore, for arbitrary n, u' G {x, y, z} we have 

5{{min,u}) + 6i{u',max}) ^ b{6i) + 2c{6i) + 2 , (3.3) 

which can be seen as follows: li u = u or u' = u, then 6{{min,u}) = or 6{{u' , max}) = 
0. 

Consequently, 

6{{min,u}) + 6{{u',max}) ^ MAX{b{5i), b{5i) + c{6i) + 1} . 
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MAX < 



If u and u' both are different from u, then 

6{{min,u}) = MAX { 5i{{mm,u}), 6i{{min,u}) + 5i{{u,u}) + I } 

and 

^({tt',max}) = MAX { 6i{{u',max}), 6i{{u, max}) + 6i{{u,u'}) + 1 } . 

Therefore, 6{{min,u}) + 6{{u' , max}) ^ 

5i{{min,u}) + 5i{{u' , max}), 

5i{{min,u}) + 6i{{u,max}) + 6i{{u,u'}) + 1, 

6i{{min,u}) + 6i{{u,u}) + 1 + 6i{{u' , max}), 

6i{{min,u}) + 6i{{u,u}) + I + 6i{{u,max}) + 6i{{u,u'}) + 1 

which, in turn, is less than or equal to 

MAX { + c(r5i) + 1, bi6i) + 2c{6i) + 2 } . 

I.e., we have shown that (|3.3() is valid. 
From (|3.2|) and we obtain 

bCS) ^ 6((5i) + 2c(5i) + 2. (3.4) 

From (|3.1I) and (|3.4j) we conclude that 

w{6)^ = c{6)^ + b{6) 

< (c(5i) + l)^ + 6(5i) + 2c(5i) + 2 

= c(5i)2 + 6(5i) + 4c(Ji) + 3 

^ + 4w(Ji) + 4 

= (u;(5i) + 2)'. 

Therefore, w{5) ^ ^((Ji) + 2. 

This completes the proof of part ^ of Lemma 13.91 ■ 

Remark 3.13. From the above proof it becomes clear, why Definition 13.41 fixes the weight 
of a separator by using the ^^-function. Let us consider the, at first glance, more straight- 
forward weight function ■w{6) := MAX {c{5), b{6)}. In the proof of part f^) of Lemma 13.91 
we then obtain from the items (|TT|) and that w{6) ^ 2c(5i) + b{5i) + 2 ^ 3uj{5i) + 2. 
Therefore, a modified version of Lemma I3.1UI where item (c) is replaced by the condition 
"// u has exactly one child vi, then w{v) ^ 3w(fi) + 2", leads to a (much weaker) 
bound of the form ||T|| ^ c • \g{w{v)). This, in turn, leads to a weaker version of Theo- 
rem EH stating that H^/"!! ^ c • Ig {w{5)). However, this bound can already be proven by 
a conventional Ehrenfeucht-Fraisse game and does not only apply for FO^(<)-formulas but 
for FO(<)-formulas in general and therefore is of no use for comparing the succinctness of 
FO^ and FO. □ 
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4. FO^ vs. FO^ 

As a first application, TIieorem l3.6l allows us to translate every FO^-sentence "0 into an 
FO^-sentence x that is equivalent to if: on linear orders and that has size polynomial in the 
size of "ip. To show this, we use the following easy lemmas. 

Lemma 4.1. Let (p be an F0(<, succ, min, max)-sentence, and let d be the quantifier depth 
of (p. For all N ^ 2'^^^, An |= 9? if, and only if, .Ajd+i |= ^. □ 

A proof can be found, e.g., in the textbook 

Lemma 4.2. For all ^ G N there are FO^(<)-sentences xi a^id X'^i of size 0{l) such that, 
for ah iV G N, An |= Xh (respectively, x^t) \Q N = £ (respectively, N ^ i). □ 

Proof: We choose 

X'^o(^) := {^ = x), 

and, for all i ^ 0, 

x'^£+i{x) ■■= 3y {y < x) A x^e{y)- 
Obviously, for all G N and all a G {0, . . , N}, we have An \= x'^^(o) iff a ^ Therefore, 
for every £ G N, we can choose x^e ■= ^^x'>f{x) and xe ■= X^i ^ ~^X^e+i- ^ 

Theorem 4.3. 

On linear orders, F0'^(<, succ, iliin, iiiax)-sentences are C'(m^)-succinct in FO^(<)-senten- 
ces. □ 

Proof: Let ■0 be an F0^(<, succ, min, max)-sentence. Our aim is to find an FO^(<)-sentence 
X of size OGIV'in such that, for all A^ G N, ^at ^ x iff -^tv |= V'- 

If ip is satisfied by all linear orders or by no linear order, x can be chosen in a straight- 
forward way. In all other cases we know from Lemma l4.ll that there exists a D G N such 
that either 

(1.) Ad 1= Tp and, for all N > D, An^iP, or 
(2.) Ao^i^ and, for all N > D, An |= i^- 

In particular, ip is an FO^-sentence that distinguishes between the linear orders Ad and 
Ad+i- From Corollarv 13 . 71 we therefore know that \\ip\\ ^ \^/D. 

We next construct an FO^-sentence x equivalent to ip: Let Xi and x^i be the F0^(<)- 
sentences from Lemma l4.2l Let x' be the disjunction of the sentences xi for all those i ^ D 
with Ae 1= ip. Finally, if ^d+i |= ip, then choose x '■= x' X^D+i] otherwise choose x '■= x'- 
Obviously, x is an FO^(<)-sentence equivalent to ip, and 

D+l 

11x11 = oiY,^) = o{D') = om\')- 

1=0 

This completes the proof of Theorem 14.31 ■ 

5. FO^ vs. FO 

Using Theorem l3.6l we will show in this section that there is an exponential succinctness 
gap between FO and FO^ on linear orders. 
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Lemma 5.1. For every F0(<, succ, mm, inax)-sentence ip there is an FO^(<)-sentence ■i/' 
of size IIV'II ^ 2*-'(ll'^ll) which is equivalent to on the class of linear orders. □ 

Proof: Let ip be an F0(<, succ, min, max)-sentence, and let d be the quantifier depth of p. 
In particular, \\p\\ ^ d. From Lemma l4.ll we know that, for all N ^ 2'^"'"^, An |= P> if, and 
only if, Agd+i \= p- 

We use, for every £ G N, the sentences and of Lemma 14.21 Let -0' be the 
disjunction of the sentences for all I < 2'^^'^ such that Ae |= p. 

Finally, if A2d+i |= p, then choose := tp' y x^2'*+i; otherwise choose tp := -0'. Obviously, 
'0 is an FO^(<)-sentence equivalent to p, and 

2d+l 

IIV-II = = 0(22(^+i)) = 2^(11^11). 

i=Q 

m 

Lemma 5.2. For all m G N there are FO'*(<)-sentences pm and sets Am and of 
interpretations, such that A^ \= pm, Bm \= ~'Pm, WfmW = 0{m), and every F0^(< 
, succ, min, max)-sentence ipm equivalent to pm has size HV'mll ^ 22™-"^. □ 

Proof: For every G N let on '■ {min, x, y, z, max} {0, . . , N} be the assignment with 
a^ix) = Oat (min) = and aN{y) = CK]\f{z) = a{max) = N. 

For every m G N we choose Am ■= {(^2^102™)} and Bm ■= {{A2^+i,a2"^+i)}- 

Step 1: Choice of pm- 

Inductively we define FO^(<)-formulas Pm{x,y) expressing that \dijj{x,y)\ = 2"^ via 

f'mi^^y) ■= 3z Vn (-u = a; V n = y) ^ Pm-i{z,u) 
(and pQ{x,y) chosen appropriately). 

It is straightforward to see that | \Pm\ \ = 0{m) and that Pm{x, y) expresses that | diff{x, y)\ = 
Therefore, 

ipm ■■= 3x3y p'^{x,y) A -^3 z {z<x V y<z) 
is an FO^(<)-sentence with the desired properties. 
Step 2: Size of equivalent ¥0^ -sentences. 

For every m G N let ipm be an F0^(<, succ, min, max)-sentence with Am |= ipm and Bm \= 
-^ipm- From Corollarv 13.71 we conclude that 

ll^^ll ^ \V2^ = 2^^-\ 

This completes the proof of Lemma 15.21 ■ 

From Lemma l5. II and l5. 21 we directly obtain 

Theorem 5.3. 

On the class of linear orders, FO(<)-sentences are -succinct in FO^(<)-sentences, but 
already FO^(<)-sentences are not 2°^™"^ -succinct in FO^(<)-sentences. □ 

Note that the relation succ and the constants min and max are easily definable in F0^(<) 
and could therefore be added in Theorem 15.31 
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6. FO vs. MSO 

In this section we compare the succinctness of FO and the FO-expressible fragment of 
monadic second-order logic (for short: MSO). This section's main result is a non-elementary 
succintness gap between FO and the FO-expressible fragment of MSO on the class of linear 
orders (Theorem 16. 5() . 

The main idea for proving this succinctness gap is to encode natural numbers by strings 
in such a way that there are extremely short MSO-formulas for "decoding" these strings 
back into numbers. The method goes back to Stockmeyer and Meyer |161ll5j : the particular 
encoding used in the present paper has been introduced in |[5j. To formally state and prove 
this section's main result, we need some further notation: 

We write MonE} for the class of all MSO-formulas that consist of a prefix of existential 
set quantifiers, followed by a first-order formula. By 3XF0 we denote the fragment of 
MonSj with only a single existential set quantifier. 

Let Tower : N ^ N be the function which maps every /i G N to the tower of 2s of height 
h. I.e., Tower{0) = 1 and, for every /i e N, Tower{h+l) = 2'^'""<^\ 

We use the following notations of |S] : 
For h ^ 1 let E/i := {0, 1, <1>, </l>, . . , <h>, </h>}. The "tags" <i> and </i> represent 
single letters of the alphabet and are just chosen to improve readability. For every n ^ 1 let 
L{n) be the length of the binary representation of the number n— 1, i.e., L{Q) = 0, L{\) = 1, 
and L[n) = [log(n— 1)J -|- 1, for all n ^ 2. By bit(z,n) we denote the i-th bit of the binary 
representation of n, i.e., bit(«,n) is 1 if |_^J is odd, and bit(i,n) is otherwise. 
We encode every number n G N by a string Hhin) over the alphabet S/^, where fj,h{n) is 
inductively defined as follows: 

|Ui(0) := <1> </l> , and 

fii{n) := <1> bit(0,n-l) bit(l,n-l) ••• bit(L(n)-l, n-1) </l>, 
for n ^ 1. For h ^ 2 we let 

fJ-hiO) ■= <h> </h> , and 
l^h{n) ■■= <h> 

fih-iiO) bit(0,n-l) 
bit(l,n-l) 

fih-iiL{n)-l) bit(L(n)-l,n-l) 
</h>, 

for n ^ 1. Here, empty spaces and line breaks are just used to improve readability. 
For /i G N let F := Tower{h). Let := S/j+i U {•}, and let 

Vh := <h+l> ^^,(0) • fih{l) • ■■■ f^hiH-l) • </h+l>. 

We consider the string-language {vh)^, containing all strings that are the concatenation of 
one or more copies of Vh. Let Wh be the (unique) string in {vh)~^ that consists of exactly 2^ 
copies of Vfi- 

We write Th for the signature that consists of the symbol < and a unary relation symbol 
Pa, for every o" € S*. Non-empty strings over S* are represented by r/i-structures in the 
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usual way (cf., e.g., jSj). We will shortly write w \= if to indicate that the r/j-structure 
associated with a E*-string w satisfies a given r/j-sentence ^p. 

The following lemma is our key tool for proving that there is a non-elementary suc- 
cinctness gap between FO and the FO-expressible fragment of MSO. 

Lemma 6.1. For every h £ N there is an 3X FO(r/i)-sentence of size 0{h'^), such that 
the following is true for all strings w over the alphabet S*: w |= iff ii; = tt;/i. □ 

For proving Lemma l6. II we need the following: 

Lemma 6.2 (0 Lemma 8]). For all /i G N there are FO(T/i)-formulas equali^{x,y) of size^ 
0{h) such that the following is true for all strings w over alphabet E/j, for all positions a, b 
in w, and for all numbers m,n G {0, . . , Tower{h)}: If a is the first position of a substring 
u of w that is isomorphic to /i/i(m) and if b is the first position of a substring v of w that 
is isomorphic to ^J-hi'^), then w |= equalf^{a, b) if, and only if, m = n. □ 

Using the above lemma, it is an easy exercise to show 

Lemma 6.3. For every /i G N there is an FO(r/i)-formula inch{x,y) of size 0{h) such that 
the following is true for all strings w over alphabet S/j, for all positions a,b in w, and for 
all numbers m,n £ {0, . . , Tower(h)}: If a is the first position of a substring u of w that is 
isomorphic to and if b is the first position of a substring v of w that is isomorphic to 

Hh{n), then w |= inch{a,b) if, and only if, m+1 = n. □ 

We also need 

Lemma 6.4. For every /i G N, the language (vh)^ is definable by an FO(T/i)-sentence ^(vh)+ 
of size 0{h?). I.e., for all strings w over the alphabet S* we have w |= ^{vh)+ if; aiid only 

if, w e □ 

Proof: The proof proceeds in 2 steps: 

Step 1: Given j ^ 1, we say that a string w over S* satisfies the condition C(j) if, and 
only if, for every position x (respectively, y) in w that carries the letter <j> (respectively, 
</j>) the following is true: There is a position y to the right of x that carries the letter 
</j> (respectively, a position x to the left of y that carries the letter <j>), such that the 
substring u of w that starts at position x and ends at position y is of the form fJ,j{n) for 
some n G {0, . . , Tower{j) — l}. 

We will construct, for all j £ {1, . . , h}, FO(r/i)-sentences okj of size 0{j) such that the 
following is true for all j ^ h and all strings w over S* that satisfy the conditions C(j') for 
aU / < j: 

w 1= okj <^=^ w satisfies the condition C{j). 
Simultaneously we will construct, for all j G FO(r/i)-sentences maxj{x) of size 

0{j) such that the following is true for all j ^ h, all strings w that satisfy the conditions 
C(l), . . , C{j), and all positions x in w: 

w 1= maxj{x) <^=^ X is the starting position of 

a substring of w of the form 

;Uj(row;er(j)-l)- 

"'^In an additional factor Ig h occurs because there a logarithmic cost measure is used for the formula 
size, whereas here we use a uniform measure. 
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For the base case j = 1 note that Tower{l) — l = 1 and, by the definition of //i(n), /ii(0) = 
<1> </l> and /xi(l) = <1> </l>. It is straightforward to write down a formula oki that 
expresses the condition C(l). Furthermore, maxi{x) states that the substring of length 3 
starting at position x is of the form <1> </l>. 

For j > 1 assume that the formula maxj-i has already been constructed. For the con- 
struction of the formula okj we assume that the underlying string w satisfies the conditions 
C(l),..,C(i-l). 

The formula okj states that whenever x (respectively, y) is a position in w that carries 
the letter <j> (respectively, </j>) the following is true: There is a position y to the right 
of X that carries the letter </j> (respectively, a position x to the left of y that carries the 
letter <j>), such that the substring u of w that starts at position x and ends at position y 
is of the form ^j(n) for some n G {0, . . , Tower{j) — l}, i.e., 

1. the letters <j> and </j> only occur at the first and the last position of n, 

2. whenever a position x' carries the letter </j-l>, position x'+l carries the letter or 1, 
and position x'+2 carries the letter <j-l> or the letter </j>, 

3. either u = <j> </j>, or the prefix of length 3 of n is of the form <j> <j-l> </j-l>, 

4. whenever x' and y' are positions in u carrying the letter <j-l> such that x' < y' and 
no position between x' and y' carries the letter <j-l>, the formula incj-i{x' ,y') from 
Lemma 1^31 is satisfied, 

5. if the rightmost position x" in u that carries the letter <j-l> satisfies the formula 
maxj-i{x"), then there must be at least one position x'" in u that carries the letter 
such that x"'—l carries the letter </j-l>. 

Note that items 1.-4. ensure that u is indeed of the form Hjin), for some n G N. Item 5 
guarantees that n £ {0, . . , Tower{j) — l} because of the following: recall from the definition 
of the string Hj{n) that Hj{n) involves the (reverse) binary representation of the number 
n—1. In particular, for n := Tower{j) — l, we need the (reverse) binary representation of 
the number Tower{j)—2, which is of the form Oil • • • 11 and of length Tower{j—l), i.e., its 
highest bit has the number Tower{j—l) — l. 

It is straightforward to see that the items 1.-5. and therefore also the formula okj can 
be formalised by an FO(r^)-formula of size 0{j), and that this formula exactly expresses 
condition C{j). 

Furthermore, the formula maxj{x) assumes that x is the starting position of a substring 
w of of the form Hj{n), for some n S N; and maxj{x) states that 

1. the (reverse) binary representation of n, i.e., the {0, l}-string built from the letters in u 
that occur directly to the right of letters </j-l>, is of the form Oil • • • 11, and 

2. the highest bit of n has the number Tower{j—l) — l, i.e., the rightmost position y in u 
that carries the letter <j-l> satisfies the formula maxj_i(y). 

Obviously, maxj{x) can be formalised in FO(r/i) by a formula of size 0{j). Finally, this 
completes Step 1. 

Step 2: A string w over E' belongs to the language {vh}'^ if, and only if, all the following 
conditions are satisfied: 

1. w satisfies oki A ■ ■ ■ A okh, 

2. the first position in w carries the letter <h+l>, the last position in w carries the letter 
</h+l>, the letter <h+l> occurs at a position x > 1 iff position x—1 carries the letter 
</h+l>, and the letter • occurs at a position x iff position x—l carries the letter </h> 
and position x+1 carries the letter <h> or </h+l>, 
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3. whenever x (respectively, y) is a position in w that carries the letter <h+l> (respectively, 
</h+l>) the following is true: There is a position y to the right of x that carries the 
letter </h+l> (respectively, a position x to the left of y that carries the letter <h+l>), 
such that the substring u oi w that starts at position x and ends at position y is of the 
form f/j, i.e., 

-k the letters <h+l> and </h+l> only occur at the first and the last position of n, 

★ the prefix of length 3 of n is of the form <h+l><h></h>, and the suffix of length 3 of 
u is of the form </h> • </h+l>, 

★ whenever x' and y' are positions in u carrying the letter <h> such that x' < y' and 
no position between x' and y' carries the letter <h>, the formula inch{x' ,y') from 
Lemma 16.31 is satisfied, 

-k the rightmost position x" in u that carries the letter <h> satisfies the formula maxh{x"). 
Using the formulas constructed in Step 1 and the preceding lemmas, it is straightforward 
to see that this can be formalised by an FO(r/i)-formula of size 0{h?'). This finally 

completes the proof of Lemma 16.41 ■ 

Finally, we are ready for the 

Proof of Lemma 16. H 

To determine whether an input string w is indeed the string Wh, one can proceed as follows: 
First, we make sure that the underlying string w belongs to {v^)'^ via the FO(r/i)-formula 
Lemma 16.41 Afterwards we, in particular, know that in each <h+l> • • • </h+l>- 
block is of the form and therefore contains exactly H positions that carry the letter 
• . Now, to each •-position in w we assign a letter from {0, 1} in such a way that the 
{0, l}-string built from these assignments is an H-numbering, i.e., of one of the following 
forms: 

1. BINh(O) BINh(1) BIN/^(2) • • • BINH(n), for some n < 2^, 

2. BINh(0)BINh(1)---BINh(2^-1) (^BINi^(O)™), for some m ^ 0. 

Here, BIN;/ (n) denotes the reverse binary representation of length H of the number n < 2^ . 
For example, BIN4(2) = 0100 and BIN4(5) = 1010. Of course, w is the string Wh, i.e., 
consists of exactly 2^ copies of v^, if and only if the i?-numbering's assignments in the 
rightmost copy of form the string BIN// (2^ — 1), i.e., if and only if every •-position in 
this copy of Vh was assigned the letter 1. 

One way of assigning letters from {0, 1} to the •-positions in w is by choosing a set X of 
•-positions with the intended meaning that a •-position x is assigned the letter 1 x £ X 
and the letter if x ^ X. 

Using the FO(T/j)-formulas equalf^ of Lemma l6.2l and ^(vh)+ of Lemma l6.4l it is straight- 
forward to construct the desired 3X FO(T/j)-formula of size 0{h). 

This completes the proof of Lemma l6. II ■ 

As a consequence of Lemma l6. II and Lemma l4. II one obtains a non-elementary succinctness 
gap between FO and MSO on the class of linear orders: 

Theorem 6.5. 

The FO(<)-expressible fragment of MonS} is not Tot(;er(o(-ym))-succinct in F0(<) on the 
class of linear orders. □ 
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Proof: Recall that, for every N gN, An denotes the linear order with universe {0, . . , N}. 

For every /i S N let i{h) := \wh\ — 1, where \wh\ denotes the length of the string Wh- 
We say that a sentence x defines the linear order A^^h) if) a^id only if, A£(^h) is the unique 
structure in {An '■ N G N} that satisfies x- For every /i E N we show the following: 
(a) Every FO (<, succ, min, max)-sentence iph that defines Ai(^h) has size \ \iph\ \ ^ Tower{h). 
(a) There is a MonS}(<)-sentence of size \\^h\\ = 0{h?) that defines A^^^y 
Ad (a): 

Since Wh consists of 2 copies of v^, we know that l{h) ^ 2'^°'^^'^'^\ Therefore, every 
FO(<)-sentence V'/i that defines A£(^h) has quantifier depth, and therefore size, at least 
Tower{h) (cf., Lemma l4.1[l . 

Ad (h): 

Let 3X (fh be the 3X FO(r/i)-sentence obtained from Lemma Ifi.ll It is straightforward to 
formulate an FO(r/j)-sentence ^/j of size 0{h?) which expresses that every element in the 
underlying structure's universe belongs to exactly one of the sets Pq-, for o" G S*. 
The MonS|(<)-sentence 

expresses that the nodes of the underlying linear order can be labelled with letters in S* in 
such a way that one obtains the string Wh. Such a labeling is possible if, and only if, the 
linear order has length Le., ^'/j defines A£(^h)- Furthermore, ||^h|| = 0{h?)^ because 

llall = 0(/i2) and \\^h\\ = 0{h''). 

This completes the proof of Theorem 16. 51 ■ 

Let us remark that by modifying the proof of the above result, one can also show that the 
FO(<)-expressible fragment of monadic least fixed point logic, MLFP, is non-elementarily 
more succinct than F0(<) on the class of linear orders. 

7. Conclusion 

Our main technical result is a lower bound on the size of a 3-variable formula defining 
a linear order of a given size. We introduced a new technique based on Adler-Immerman 
games that might be also useful in other situations. A lot of questions remain open, let us 
just mention a few here: 

• Is first-order logic on linear orders poly(m)-succinct in its 4-variable fragment, or is there 
an exponential gap? 

• As a next step, it would be interesting to study the succinctness of the finite- variable 
fragments on strings, that is, linear orders with additional unary relation symbols. It 
is known that on finite strings, the 3-variable fragment of first-order logic has the same 
expressive power as full first-order logic. Our results show that there is an at least 
exponential succinctness gap between the 3-variable and the 4-variable fragment. We do 
not know, however, if this gap is only singly exponential or larger, and we also do not 
know what happens beyond 4 variables. 

• Another interesting task is to study the succinctness of various extensions of (finite vari- 
able fragments of) first-order logic by transitive closure operators. 
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• It also remains to be investigated if our results can possibly help to settle the long standing 
open problem of whether the 3-variable and 4-variable fragments of first-order logic have 
the same expressive power on the class of all ordered finite structures. 
Finally, let us express our hope that techniques for proving lower bounds on succinctness 
will further improve in the future so that simple results such as ours will have simple proofs! 
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